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l/^ . Abstract 

ly^ ' This paper considers the muhi-input muhi-output (MIMO) relay channel where multiple antennas are 

o 

I employed by each terminal. Compared to single-input single-output (SISO) relay channels, MIMO relay 



o 



X 



channels introduce additional degrees of freedom, making the design and analysis of optimal cooperative 
strategies more complex. In this paper, a partial cooperation strategy that combines transmit-side message 
splitting and block-Markov encoding is presented. Lower bounds on capacity that improve on a previously 
proposed non-cooperative lower bound are derived for Gaussian MIMO relay channelj^. 

Keywords - Relay channels, MIMO systems, superposition coding, dirty-paper coding. 



1 Introduction 

Mesh networks that support multihop communication form an integral part of future-generation wireless 

communications [3-5]. Relay channels are the fundamental building blocks of multihop mesh networks. 

From [6], a discrete memoryless relay channel is defined by {Xi x X2,p{y,yi\xi,X2),y x yi). Here, Xi, X2, 

yi and y are finite sets corresponding to the transmitter, the relay and the receiver as shown in Fig. [TJ 

^ These lower bounds are special cases of [6, Theorem 7], but are of interest as they are applied to the MIMO relay channel 
and provide intuition about the structure of good coding strategies for MIMO relaying. 
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Relay channels were introduced in [8] and upper bounds on their capacity were derived in [9] . Full-duplex 
relay channels were first analyzed from an information-theoretic perspective in [6], where inner and outer 
bounds were derived and exact capacity expressions were obtained for special cases such as the physically de- 
graded and Gaussian degraded relay channels. The information-theoretic analysis in [6] relied on cooperation 
between the transmitter and the relay induced by block-Markov encoding. 

Achievable rates in relay channels can be further improved via multi-input multi-output (MIMO) tech- 
nology [10-13]. It has been shown that the capacity of a MIMO channel can scale linearly as the minimum 
of the number of transmit and receive antennas [14]. This encouraging result has led to research on mul- 
tiuser MIMO channels such as Gaussian multiple access (MAC) [15-18] and broadcast (BC) [19-22] channels. 
Although discrete memoryless relay channels were analyzed decades ago, MIMO relay channels have only 
recently been studied [24]. As MIMO is an integral aspect of industry standards such as IEEE 802. 16e [2], 
and relaying is also being considered for practical implementation [1], it is natural to consider the perfor- 
mance limits of MIMO relaying. In particular, MIMO relaying has gained increasing attention recently, and 
results have been obtained in terms of capacity scaling laws in large networks [25] , capacity scaling laws for 
two-way relaying [26] and optimal precoder design [27]. 

In [24], a Gaussian relay channel with multiple antennas at each terminal is considered. Upper and lower 
bounds on capacity are shown for both deterministic and Rayleigh fading channels. The lower bounds for 
the case of fixed channels in [24] arise from a non-cooperative transmit strategy. Higher achievable rates 
than those yielded by the non-cooperative approach in [24] can be obtained by observing that MIMO relay 
channels inherently contain more degrees of freedom than single-input single-output (SISO) relay channels, 
where each terminal employs only a single antenna. 

We assume that the relay performs partial dccodc-and-forward operations, where the relay decodes a 
portion of the transmitter's codeword, encodes the decoded message using its own codebook, and sends the 
encoded message to the receiver. In a MIMO relay channel, the channel eigenmodes can be exploited to 
optimize the cooperative role of the relay. Thus, coding strategies such as transmit-side message splitting [6,7] 
can increase the achievable rate for MIMO relay channels. 

In Section V we consider a simple numerical example that illustrates the role that the channel eigenmodes 
play in optimizing the cooperative role of the relay for the MIMO case. One scenario that we consider in 
Section V involves the transmitter-to-relay vector channel being orthogonal to the vector direct link. In 
particular, this notion of orthogonality is a special case of the orthogonal relay channel considered in [35], 
where the general partial decode-and-forward strategy in [6, Theorem 7] is shown to be capacity-achieving 
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for both discrete-memoryless and Gaussian cases. For the Gaussian case, the cooperative role of the relay is 
optimized by power allocation over both components of the transmitter's codeword, which is analogous to 
power allocation over the channel eigenmodes for the MIMO case. 

We present transmission strategies that rely on message splitting to support varying levels of cooperation 
between the transmitter and the relay in a MIMO relay channel. In this policy, the transmitter has two 
messages and chooses its codeword as a function of both of them; the relay, though, only has to decode one of 
these messages. The key intuition behind the application of message splitting to MIMO relaying is as follows: 
for the Gaussian SISO relay channel, while message splitting increases the average throughput [33,34], it is not 
a capacity-achieving strategy. This is based on the fact that in a Gaussian SISO relay channel, the received 
signals at the relay and the receiver can be statistically ordered. On the other hand, the received signals 
in a Gaussian MIMO relay channel cannot be statistically ordered. In particular, the channel eigenmodes 
determine the optimal level of cooperation between the transmitter and the relay in a MIMO relay channel, 
which is measured by how the transmitter chooses its codeword as a function of both messages. 

We stress that our message splitting strategies are special cases of the partial cooperation approach 
in [6, Theorem 7]. Since a direct application of the general coding strategy in [6, Theorem 7] to Gaussian 
MIMO relay channels would require a computationally intensive optimization over several auxiliary random 
variables, we consider simplified coding approaches and obtain closed-form achievable rate expressions. 

We propose lower bounds on the capacity of the MIMO relay channel by utilizing transmit-side message 
splitting. In particular, we consider both superposition coding and precoding at the transmitter. For the 
case of precoding in a Gaussian MIMO relay channel, dirty-paper coding [23] is employed at the transmitter. 
Our proposed lower bounds obtained via a combination of transmit-side message splitting and block-Markov 
encoding improve on the lower bounds from [24] that are obtained by a non-cooperative transmit strategy 
that does not employ block-Markov encoding. The block-Markov encoding that we employ differs from the 
approach in [6] in that the relay cooperates with the transmitter over two consecutive transmission blocks 
to transmit only one of the transmitter's two messages. The non-cooperative approach in [24] is actually 
a special case of our proposed strategies. We also perform a simple numerical analysis that illustrates how 
the achievable rate from our precoding approaches depends on the exact channel state and not just on the 
channel norms. 

The rate bounds in this paper along with an initial version of the numerical results in Fig. |3]were initially 
presented in [28]. This paper contains the full proofs of some of the key rate bounds, which lends valuable 
insights on the key encoding and decoding mechanisms for transmit-side message splitting in the MIMO 
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relay channel. We have also obtained revised numerical results for Fig. HI In addition, we have added Fig. 
[5] and Fig. [6l which illustrate the impact of system topology on the derived rate bounds. 

This paper is organized as follows: In Section II we describe the system model. Section III reviews 
the upper and lower bounds on capacity from [24] for the Gaussian MIMO relay channel. In Section IV, 
we present our message splitting strategies for Gaussian MIMO relay channels along with their associated 
achievable rates. Numerical results are given in Section V. We conclude the paper in Section VI. The 
appendix contains rigorous derivations of some of the achievable rate expressions. 

We use boldface notation for matrices and vectors; uppercase notation is used for matrices while lowercase 
notation is used for vectors. E represents mathematical expectation. Re(a;) denotes the real part of a complex 
number x. For a matrix A, A^, tr(A) and det(A) denote the transpose conjugate, trace, and determinant, 
respectively of A while A ^ means that A is positive semi-definite. SNR represents signal-to-noise ratio. 
Ik denotes the K x K identity matrix. We use CJ\f{h, C) to represent the circularly symmetric complex 
Gaussian distribution with mean b and covariance matrix C. For a set TZ, \\TZ\\ denotes the cardinality of TZ. 

2 System Model 

Consider the Gaussian MIMO full-duplex relay channel illustrated in Fig. [2j Let xi and X2 be the Mt 
X 1 and Mr x 1 transmitted signals from the transmitter and the relay. Let y and be the Nt x 1 and 
Nr X 1 received signals at the receiver and the relay. Define Hi, H2, and H3 as Nr x Mt, Nt x Mt and 
Nt X Mr channel gain matrices. Define z and zi as independent Nt x 1 and Nr x 1 circularly-symmetric 
complex Gaussian noise vectors with distributions CM{0, ItvJ and CM{0, Iat^). 

We assume that the transmitter is subject to a power constraint E(x|xi) < Mt and that the relay is 
also subject to a power constraint E(x2X2) < Mr. We also assume that the relay has two sets of antennas, 
with one set for the receiver and one for the transmitter, so it operates in a full-duplex mode. The relay 
also cancels out interference from its transmitter array at its receiver array. In addition, we assume that all 
channel matrices are fixed and known at all three terminals and that z and zi are uncorrelated with xi and 
X2. We do not consider fading channels in this paper. 

We define parameters related to the SNR at the receiver and at the relay as 71 = SNRi/Mt, 72 = 
SNR2/ Mt, and 73 = SNR^/Mr where SNRi and SNR2 are the expected SNR values for xi after fading 
at each receive antenna at the relay and the receiver, and SNR^ is the expected SNR for X2 after fading at 
each receive antenna at the receiver [12]. 
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With these definitions, the received signals at the relay and at the receiver are 

Yi 

y 



.^Hixi + zi 

y^H2Xl + y^H3X2 + Z. 



(1) 



2.1 Weak Typicality 



Our proofs in this paper rely on the notion of weak typicality [32]. Let X ~ p{x) be a random variable. 
The set A^^\x) of weakly typical sequences x", where p{x'^) = HILi^'l^*) 

1 



n 



■\ogp{x") - H{X) 



< e 



Note that our results for discrete memoryless channels based on finite alphabet codebooks can be general- 
ized in a straightforward manner to Gaussian channels with Gaussian codebooks. This generalization is based 
on applying weak typicality to continuous distributions that are subject to second moment constraints [32]. 



3 Background 

It was shown in [6, Sec. Ill] that the capacity C of a general relay channel is upper-bounded as 

C< max min{/(Xi;y,yi|X2),/(Xi,X2;y)} 



(2) 



where the first term in the minimization is the rate from the transmitter to the relay and the receiver and 
the second term is the rate from the transmitter and the relay to the receiver. 

Now let xi and X2 be random vectors with mean zero and covariance matrices Sjj = E(xjxt). The authors 
of [24] established the following capacity upper bound and lower bound for the case where the channel gains 
are fixed and known at each terminal. 

Lemma 3.1. [24, Sec. Ill] An upper hound on the capacity of the Gaussian MIMO relay channel is given by 

< Copper = max min(Cf , C^) (3) 
where trfl^u) < Mt, tr (B22) < Mr and 



C7f ^ log 



dct Im, + (1 - p^) 













5:11 




•)1 


_^/7^i?2. 









(4) 



C§ ^ inf„>o log[det(JAr, + (72 + ^^/yrii)H2^iiH\ + (73 + a.Jjru)H^^22H\)]. 
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As stated in [24, Sec. IIIA], p represents the correlation between xi and X2. Also, a is a constant that 
arises from the vector- valued inequality in [24, Lemma 3.2]. In addition, Cf and C2 represent the maximum 
sum rate across the transmitter-side broadcast cut and receiver-side multiple-access cut, respectively, in the 
Gaussian MIMO relay channel. 

Lemma 3.2. [24, Sec. Ill] A lower hound on the capacity of the Gaussian MIMO relay channel is given by 

> CLer = max(QG, mm{C^ , Cf)), (5) 

where 

4 maxsi,log[det(J^, +72iJ2Sii4)] 

^ maxsiilog[det(J^,+7ii?iSni?l)] (6) 
Cf ^ maxs22 log[det(J^, + ^^H^Y.22HI{In, + ^2H2^\^h\)-^)] 

with 

^ arg max log[det(JAr, + 7ii?i5]n/fl)]. (7) 

Our objective is to use transmit-side message splitting to improve upon the bound in Lemma |3.2[ We 
outline this strategy in the next section. 

4 Transmit- Side Message Splitting 

Next we describe the transmission strategy that is employed in this paper. We divide the transmit message 
into two components, denoted by the random variables Wu and w^- Wu is the message that is decoded by the 
relay and is thus cooperatively sent by the transmitter-relay pair to the receiver, w^, however, is intended 
to be decoded only by the receiver, and thus is a source of "interference" at the relay that is known a-priori 
at the transmitter. 

We consider two classes of transmission strategies with this setup. The first is superposition coding, where 
codebooks for Wu and are determined separately and then simply superposed (added to one another) at 
the transmitter. The second strategy is to utilize precoding at the transmit end, where intuitively the 
transmitter attempts to mitigate the interference caused by to the desirable signal corresponding to Wu 
at the relay. For both strategies, the transmitter and the relay cooperate in block-Markov encoding of Wu- 

Note that the receiver must determine both Wu and to decode the transmit message. Thus, if Ru 
denotes the rate for the codebook corresponding to Wu and that for w^, the net achievable rate for both 
superposition coding and precoding \s R = Ru + Rv Assuming the receiver successively decodes Wu and w^, 
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the order in which they are decoded impacts their rates. In this paper, we use both decoding orders and 
choose the order that maximizes the overall rate. 

Let u and v be auxiliary variables representing the contribution of Wu and w^, respectively to xi. Define 
Su, and 1^X2 to be the covariance matrices of u, v and X2 respectively. Also, define 

r S„ E(ux^) 
_E(x2ut) S,, 

and B = [.^72112 ^/TijHs]. In this case, E(uu^) = In addition, define Xi, A2, U and V as the finite 
alphabets for xi, X2, u and v, respectively. 

4.1 Superposition Coding 

Consider the system illustrated in Fig. [3l Assume that the receiver attempts to decode Wu before 
decoding w^. Let Rsc,u be the achievable rate for this case. By applying the partial cooperation strategy 
of [6, Theorem 7] to this case, it can be proved that 

Rsc,u = sup {Rsc,u,l + Rsc,u,2) (8) 
p{xi,X2,u,v) 

where 

Rsc,u,i = mm{I{U;Yi\X2),I{U,X2;Y)) 
Rsc,u,2 = I{V;Y\U,X2) 
and the supremum is taken over all joint distributions 

p{xi,X2, U, v) = p{x2)p{u\x2)p{v)p{xi\u, v) 

on A'l X A'2 X Z// X V. In particular, I{U ; yi|X2) is the maximum signaling rate for Wu over the transmitter-to- 
relay link. Also, I{U,X2\Y) is the maximum signaling rate for Wu over the effective multiple-access channel 
from the transmitter and relay to the receiver. In addition, liy; Y\U, X2) is the maximum signaling rate for 
Wy over the transmitter-to-receiver link. 

For the Gaussian MIMO relay channel, we employ Gaussian codebooks for u and v at the transmitter. 
We prove in Appendix I A . II that 

'det (Itv, + 71H1 (e^ - E(ux|)5]-iE(x2ut) + 

I{U- Y,\X2) = log I ^ ^ 1 L I , (10) 

det [In, + 7iHiS„H{ 

'det ( In, + 72H2S^,H5 + BAB^' 



In, + 72M22.^±l^ 

I{U, X2;Y) = log I ^ I , (11) 

det iTVt +72H25]„HT 
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and 

I{V-Y\U,X2) = log(det(I;v, +72H25],h5)). (12) 

Now assume that the receiver attempts to decode Wy before decoding Wu- Let Rsc,v be the achievable 
rate for this case. By applying the partial cooperation strategy of [6, Theorem 7] to this case, it can be 
proved that 

Rsc,v = sup {Rsc,v,l + Rsc,v,2) (13) 
p(xi,X2,u,v) 

where 

Rsc,v,i = mm{I{U;Yi\X2)J{U,X2;Y\V)) ^ ^ 

(14) 

Rsc,v,2 = I{V;Y) 
and the supremum is taken over all joint distributions 

p{xi,X2, U, V) = p{x2)p{u\x2)piv)p{xi\u, v) 

on Xi X X2 X U X V. 

In this case our choice of Gaussian codebooks for u and v in a Gaussian MIMO relay channel yields 

'det (ijvt +72H25]^h5 + BABt' 



which is analogous to the rate in (I12p and 

I{U, X2; Y\V) = log(det(I^, + BABt)), (16) 

which is analogous to the rate in (jlip while 1{U; Yi\X2) is the same as in (jlOp . 

The objective is to choose the decoding order that yields a higher overall rate. We now state the following 
intuitively obvious result, which will not require a formal proof. 

Proposition 4.1. Let Rgc he the maximum signaling rate for the Gaussian MIMO relay channel where the 
transmitter employs superposition coding. Then 

where Cj^^^^ is given in Lemma \37B . 

By setting v = xi and u = 0, we can achieve C^- Also, by setting u = xi, v = and having the relay 
employ a codebook of the same cardinality as that of the codebook at the transmitter, we can achieve at 
least min(C|^,Cf ). 
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4.2 Precoding 

Instead of superposition coding, consider a strategy where the transmitter uses precoding to mitigate the 
interference caused by to the desired signal corresponding to Wu at the relay. Assume that the receiver 
attempts to decode Wu before decoding w^- Let Rpre,u be the achievable rate for this case. It is proved in 
Appendix IA.2I that 

Rpre,u — sup {Rpre,u,l ~l~ Rpre,u,2) (-^^) 
p{xi,X2,u,v) 

where 

Rpre,u,i = m\n{I{U-Yi\X2)-I{U-V\X2),I{U,X2;Y)) ^^^^ 

Rpre,u,2 = I{V;Y\U,X2) 

and the supremum is taken over all joint distributions 

p{xi,X2, U, V) = p{v)p{X2)p{u\x2)p{xi\u, v) 

on JYi X X2 xU xV. Note from the form of the joint distributions that u and v are correlated, which differs 
from the case of superposition coding. The transmitter selects u as a function of the known interference v 
on the transmitter-to-relay channel Hi. 

For the Gaussian MIMO relay channel, we employ Gaussian codebooks for u and v. In particular, we 
choose u = Gv -|- x'^^ and xi = x'^^ -|- v, where x'^^ and v are chosen to be independent. Thus we are employing 
dirty-paper coding at the transmitter and the objective is to choose G to maximize I{U; 1x1X2) - I{U; V\X2)- 
We define 5] ' , to be the covariance matrix of Xi given knowledge of X2. By following a procedure similar 
to that in [29, Appendix C], we have 

I{U; Yi\X2) - I{U; V\X2) = log(det(I;v. + TiHiE^. |^^h|)), (20) 

which is analogous to the rate in (fTUj) : 1{U, X2]Y) and 1{V;Y\U, X2) are the same as in pT|) and 
respectively. 

Now assume that the receiver attempts to decode before decoding Wu- Let Rpre,v be the achievable 
rate for this case. It is proved in Appendix lA. 31 that 

Rpre,v — sup (^Rpre,v,l ~1~ Rpre,v,2^ C^-^) 
p{xi,X2,u,v) 

where 

Rpre,v,i = mm{I{U;Yi\X2)-I{U;V\X2),I{U,X2;Y\V)) 

(22) 

Rpre,v,2 = ^) 
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and the supremum is taken over all joint distributions 

a;2, n, v) = p{x2)p{u, v\x2)p{xi\u, v) 

on Xi X X2 X U X V. 

In this case our choice of dirty-paper coding at the transmitter in a Gaussian MIMO relay channel results 
in Y), 1{U; I1IX2) - 1(^7; V\X2) and 1{U, X2;Y\V) being the same as in ([l5]), ([20]), and ([IS]) respectively. 

The objective is to choose the decoding order that yields a higher overall rate. We now state and prove 
the following result. 

Proposition 4.2. Let Rpre be the maximum signaling rate for the Gaussian MIMO relay channel employing 
dirty-paper coding at the transmitter. Then 

Rpre — ^^^(,Rpre,u: Rpre,v) ^ Rsc (^^) 

where Rsc is given in Proposition \4. 1\ 

Proof. We show that superposition coding is a special case of our preceding strategy. Without loss of 
generality, assume that Wu is decoded first at the receiver. Recall that 

Rpre,u = sup {mm{IiU;Yi\X2) - I{U;V\X2), (24) 

p{v)p(x2)p{u\v,X2)p{xi\u,v) 

I{U,X2;Y)) + I{V;Y\U,X2)). 

By considering the case where u and v are independent random variables, we find that p{u\v,X2) = p{u\x2) 
and I{U;V\X2) = 0. Thus, ([MD reduces to 

Rsc,u= sup {mm{I{U;Yi\X2),I{U,X2;Y))+I{V;Y\U,X2)). (25) 

p{x2)p{u\x2)p{v)p{xi \u,v) 

It immediately follows that Rpre > Rsc- □ 



5 Numerical Results 

We employ a simple example to demonstrate how transmit-side message splitting outperforms the bounds 
in [24, Sec. III]. We choose H2 = [1 0] and H3 = 1. We also choose Hi = [x y], where x,y £M, and constrain 
||Hi|| = 10. By considering Hi and H2 as two-dimensional vectors, we can define an "angle" G(Hi,H2) 
between them. We vary 0(Hi,H2) over the range [0,7r], where G(Hi,H2) is expressed in radians. As 
G(Hi,H2) — > 7r/2, the gain between the second transmit antenna and the relay's antenna, or y, increases. 
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Note that the norm constraint on Hi causes the gain between the first transmit antenna and the relay's 
antenna, or x, to decrease. 

We consider three system topologies. The first topology is where the transmitter, the relay, and the 
receiver are equidistant, and this is modeled by setting 71 = 72 = 73 in ([T]). The second topology is where 
the relay is closer to the transmitter than to the receiver, and this is modeled by setting 72 = 73 and 
71 = IO72 in ([1]). The third topology is where the relay is closer to the receiver than to the transmitter, and 
this is modeled by setting 71 = 72 and 73 = IO71 in ([T]). For all three topologies, we observed that the lower 
bound from [24, Sec. Ill] is 1 bits/s/Hz for all values of 0(Hi, H2), which results from our fixing H2 at [1 0]. 

We need to solve the optimization problems dH), ()13p . (jlSp and (|2ip to obtain the achieved rates for our 
message splitting strategies. We employ numerical direct search methods such as the Nelder-Mead algorithm, 
differential evolution and simulated annealing to solve ([8|), (fT3|) . (fT8]) and ([2T]) . 

Fig. [3] shows the rates that are achieved by our message splitting strategies along with the upper and lower 
bounds from [24, Sec. Ill] for the first topology. We see that the upper bound decreases as 0(Hi, H2) Tr/2 
radians. Also, as 0(Hi,H2) 7r/2, the transmitter uses more power on its second transmit antenna to 
exploit the rate benefits on the transmitter-to-relay link. This strategy, though, results in a loss of rate 
on the direct link since H2 is fixed at [1 0]. This leads to a monotonic decrease in the upper bound as 
e(Hi,H2) ^ V2. 

We see that the achievable rates via superposition coding and dirty-paper coding always outperform the 
lower bound of 1 bits/s/Hz. Also, we see that the achievable rate from dirty-paper coding is never less than 
the achievable rate from superposition coding. 

Fig. [5] shows the rates that are achieved by our transmit-side message splitting strategies along with the 
upper bound from [24, Sec. Ill] for the second topology. As in Fig. U we see that the upper bound and our 
achievable rates via message splitting monotonically decrease as 0(Hi,H2) — > 7r/2 radians. 

Fig. [6] shows the rates that are achieved by our transmit-side message splitting strategies along with both 
the upper and lower bounds from [24, Sec. Ill] for the third topology. Here, we see that the lower bound 
monotonically increases as G(Hi,H2) — > it/2 radians. Also, superposition coding performs comparably 
to dirty-paper coding over all angles in this case, whereas for the other two topologies dirty-paper coding 
generally significantly outperformed superposition coding. 
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6 Conclusion 

We derived new lower bounds on capacity for MIMO relay channels via transmit-side message splitting. 
Our proposed bounds improve upon the lower bounds that were introduced in [24]. In particular, our 
results show the benefits of employing the relay's assistance via superposition coding and precoding at 
the transmitter. Our results suggest that transmit-side message splitting should be an integral part of 
communication over MIMO relay channels, especially when the transmitter-to-relay link is strong relative to 
the transmitter-to-receiver and/or relay-to-receiver channels. 



A Proofs Of Rate Bounds 



A.l Establishing (fTIl 

We have I{U]Yi\X2) = h{Yi\X2) - h{Yi\X2,U). Since the transmitter employs superposition coding, we 
have 

yi = VtTHiXi + zi 

= ^/7rHi(u-Fv) -Fzi 
and since u and v are independent given X2, and v and X2 are independent, we have 



hiYi\X2) = hi^Ui{U + V) + Zi\X2) 

= log((27re)^'- det(7iHi(5],|,, + J:,)u\ + I^J) 

and since zi is independent of u, v and X2 we have 

hiYi\X2,U) = h{,/^i{U + V) + Z^\X2,U) 

= /i(^7rHiF + Zi|X2,[/) 

= /i(^7lHiF + Zi) 

= log((27re)^'- det(7iHi5],Hl + %,,)). 

Now we note that log((27re)*^* det(5]„I^J) = h{U\X2) = h{U,X2) - h{X2) = 
log((2^e)*^*+^'-det(^)) - log((2^e)^'^'-det(S^2)) where 



A 



Efux 



^X2 



SO 



.E(x2ut) 

det(^) = det(S^2) • det(S„ - E(uxT)S^^iE(x2u'f)) 



(26) 



(27) 



(28) 
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and 



'u\x2 



Thus we have 



h{Yi\X2) = log((27re)^'- det(7iHi(: 



(29) 



and finally we obtain 



I{U;Yi\X2)=log 



det (l^, + 7iHi (S, - E(ux|)S-iE(x2ut) + S,) 



) 



) 



(30) 



det I^, +7iHiS„H 



A. 2 Achievability Proof of m 

This proof relies on the concept of backward decoding, which was introduced in [30]. 

A. 2.1 Block Markov Encoding and Backward Decoding 

Consider B+1 blocks of transmission, each consisting of n symbols. A sequence of B messages, Wi = 
{wu,i, Wy^i) £ W, i = 1,2,. . . ,B, each selected independently and uniformly over W is to be sent over the 
channel in n(B + 1) transmissions. 

The senders use a triply-indexed set of codewords: 



W2u is sent cooperatively by both senders in block i to help the receiver decode the previous message 
'Wu,i-i- To be more specific, the message W2u is the relay's estimate of the transmitter's message Wu in the 
previous block. See Tabled] for details. 

Backward decoding is employed at the receiver to decode 'Wu,i and w^^i. Thus, after block B+1, y{B + 1) 
is used to decode 'Wu,b and Wy^B- Then, y{B) and Wu,b are used to decode Wu,b-i and Wy^B-i- Next, 
y{B — 1) and Wu,b-i are used to decode Wu,b-2 and Wy^B-2- The process continues until y(2) and Wu,2 are 
used to decode Wu,i and w^^i. 

A. 2. 2 Generation of Random Code 

Yyk p{v)p{x2)p{u\x2)p{xi\u,v). Generate at random 2"^" i.i.d. sequences according to ~ nr=i^^(^i)) 
and index them as v"'{wy),Wv G {1,2, . . . ,2"-^"}. Generate at random 2"^" i.i.d. x'2 sequences according to 



C 



{{v'^{wy),u'^{k, W2u),X2{w2u)) : G {(/>, 1, 2, . . . , i 
A; G {1, 2, ... , 2"(^(t^;^il^2)-5(e))}^ g {^^ 1^2,... 




(31) 
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~ nr=i Pix2i), and index them as x'^{w2u),W2u €{1,2,..., 2"^"}. For each x^{w2u), generate 2"(^(^;^il-^2)-e) 
conditionally independent ti" G Ae^\u) sequences according to p{u\x2), and partition them into 2"^" equal- 
sized bins for each (w2u)- This defines the random codebook C = {{v"'{wv), u"'{k, W2u), (^2u))}- Finally, 
if X2) G Ai"'\ generate the codeword x'^ via -y"). 

The bin partitioning of the sequences implicitly defines a function T where T : u^{k,W2u) — Wu- 
Here, k G {1, 2, . . . , 2"(^(^;^i|-^2)-6)}^ G {</>, 1, 2, . . . , 2"-^^"}, and G {</>, 1, 2, . . . , 2"-^"}. For example, 
J^(m'"(1,u;2«)) = r{u''{2,uj2u)) = ... = J^K(2"(^('^'^il^2)-^"-^),w;2n)) = u;„ = 1. We see that T maps 
sequences vP'{k,W2u) into their corresponding bin (and hence, message) indices Wu- Since there is a one-to- 
one mapping between a sequence vP'{k,W2u) and its bin Wu, we can also write J^{vP'{k,W2u)) as {F{k,W2u)- 

A. 2. 3 Encoding 

Let Wu,i G {1, 2, . . . , 2"'^"} and w^j^j G {1, 2, . . . , 2""^"} comprise the new message to be sent in block i. 
Then, select any u"'{k,Wu^i-i) in bin Wu^i such that (2:2 ^"(/s, ^"(wu^j)) G Ag"^ if this triplet 

exists. Use the selected vP along with v'^{wy^i) to generate x^ via w") and transmit this x^. 



Assuming that the relay estimated Wu,i-\ for Wu,i-i in block i - 1, then the relay sends X2{wu,i-i) in 
block i. 

A. 2. 4 Encoding and Decoding Error Analysis 

We first perform an error event analysis for the encoding stage. 

Encoding stage: The transmitter looks for a u^{k,Wu,i-i) such that 

a sequence u^{k,Wu,i-i) satisfying this criterion can be found, 
then the relay declares T{u'^{k,Wu,i-i)) as Wu,i- Here, Eoi is the event where (-u", 'y"'(u;^,i), X2 ^ 
A^^^u^ such that J^{vP) = Wu,i; we have 



Here, P{{x'l,u'^{k,Wu,i-i 



),^"K0)e4"^)>l 



— e. 



P{Eoi) 



P((u,V,X2K,i_l))^4")) 

J]p(t;")P((u,t;-,X2K,,-i 



)) ^ 4"^) 



V 



^pK)(l-P(K,^;",X2K,i-i 



)) e 4"^)) 



2n(I(U;Yi\X2)-Ru-e) 



< e 



_2n(I(U;Yl\X2)-Ru-e)2-n{I(U;V\X2)+e) 



(32) 
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which is arbitrarily small for n sufficiently large if 

Ru<I{U;Yi\X2)-I{U;V\X2)-2e. (33) 

We note that ([32]) follows from the following two facts: 

i) 2-"(^(f^;^l^2)+e) < p((^,",^;«,x2K,^-l)) S A^^) and 

ii) (1 - xf < e-'"' for < X < 1 and k>l. 

Thus, Wu^i = Wu,i with P{EQi) arbitrarily small if n is sufficiently large and if 

Ru < HU; Yi\X2) - I{U- V\X2). (34) 

Note that for Wu,i and tf^^j, we perform backward decoding at the receiver, though we perform block- by- 
block decoding at the relay. The following analysis is for the case where the receiver attempts to decode Wy, 
before decoding Wy. 

Here, we proceed through three decoding steps. We employ the concept of weak typicality. Define the 
following error events: 

• Eli as the event that {u^{ki,W2u,i),X2{'W2u,i),yi{i),y"'ii)) ^ where y"(«) and y'^{i) are the obser- 
vations by the relay and receiver, respectively in block i. 

• Emi as the event that there is an error in block i at decoding step m — 1 for m = 2,3,4. 

Thus, the overall probability of error Pi"^ = -P(Um=o-^™) — Ylm=o ^(^'mi)- ^^^t note that for n 
sufficiently large, P{Eii) < e by the asymptotic equipartition property (AEP). Now we bound P{Emi) for 
m = 2,3,4 as follows. 

Decoding step 1: Upon observing (i), the relay receiver declares that Wu,i = Wu is sent if it is the 
unique index such that {u'^{ki,Wu,i-i), X2{wu,i-i),yi{i)) S Ai"'\ where u"'{ki,Wu,i-i) is in bin Wu,i- Here, 
E2i is the event that 3wu / Wu,i such that {u'^{ki,Wu,i-i),X2{wu,i-i),yi{i)) G Ti"\ where u'^(ki,Wu,i~i) is 
in bin Wu- Now, for Wu / Wu,i, 

P{E2i\Wu) = P{{u{k,Wu,i-l),X2{Wu,i-l),yiii)) A'f'^) 

= X] p{u"-{ki,Wu,i-i),yi{i)\x2{wu,i-i) 

= Yl p(n"(A:i,u;„,i_i)|x5(t(;n,i-i))p«(0k2(^^«,i-i))(35) 

< A(")(C/,yi|4)| 2^"(^(^l^2)-€)2-n(/^(ri|X2)-€) 
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< 2-niHiU\X2)+HiYi \X2)-HiU,Yl\X2)~3e) 
^ 2-niH(Yi\X2)-H{Yi\X2,U}-3e) 
^ 2-niI{U;Yi\X2)-3e) 

where (j35p follows from the fact that yi{i) and u{ki,Wu,i-i) are independent for Wu ^ Wu,i- Thus, we have 

P{E2i) < 2"-^" • 2-"(^(f^;'^il^2)-3^) (36) 

and so Wu,i = Wu,i with P{E2i) arbitrarily small if n is sufficiently large and if 

Ru<I{U-Y^\X2). (37) 

Decoding step 2: Backward decoding is employed to estimate Wu^i at the receiver. Assume that the 
receiver has estimated Wu,i+i for Wu,i+i- Now, the receiver looks for a unique W2u such that 

In) 

{u^{ki,W2u),X2(w2u),y^{i)) £ Ae , where J^{ki,W2u) = Wu,i+i- It then declares Wu = W2u if this unique 
W2u exists. Here, E^i is the event that 7^ Wu,i such that {u'^{ki,Wu),X2{wu),y"'{i)) G where 
J^{ki,Wu) = Wu,i+i- Now, for Wu / Wu,i, 

P{E3i\wu) = P((u,X2(u;„),y(i)) G AW) 

E (38) 

(u"(A;i,t5„),x5(ro„),y"(i))gA^"'{[/,X2,y),u)u^i«„,i,JP(A:i,w)u)='u}u,i+l 

p(ti"(/ci,u)„),4(7Z)„),y"(i)) 



E 

p(tx"(A:„«)„),x^(u}J)p(y"(i)) (39) 



(M"(A:i,t«„),xJ(to„),y"(i))GA^"'(Lf,X2,y),-!iju7^iOu,i,.7^{fci,u'u)='!l>u,i+l 



< 



Ai''\u,X2,Y) 2-"(^(^'^2)'^)2'"(^(^)~') 

2-"(^(^'^2)+/^(l')--f/(C/,X2,y)-3£) 
^ 2-n(IiU,X2;Y)-3e) 

where ([5^]) follows from the fact that y(i) and (u{ki,Wu),X2{wu)) are independent for Wu 7^ ifu.j- Thus, we 
have 

P{E3i) < 2"-^" • 2-"(^(^'^2'^)-3^) (40) 
and so = Wu,i with P{E3i) arbitrarily small if n is sufficiently large and if 

Ru<IiU,X2;Y). (41) 
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Now, we combine ([3^ . ([371) and ([TO to obtain 

Ru < mm{{I{U;Yi\X2) - I{U;V\X2)), I{U,X2;Y)). (42) 

Decoding step 3: Backward decoding is also employed to estimate Wy^i at the receiver. Assume that the 
receiver has estimated Wu,i+i for Wu,i+i- Recall that the receiver has estimated ■Wu,i for Wu,i in decoding 
step 2. Now, the receiver looks for a unique Wy such that {u^{ki,Wu,i), X2{wu,i),y^{i),v'^{wv)) £ Ai^\ where 
J^{ki,Wu,i) = 'Wu,i+i- It then declares Wy = Wy if this unique Wy exists. Here, E^i is the event that 3wy / Wy^i 
such that {u'^{ki,Wu,i),X2{wu,i),y'^{i),v"'{wy)) G Ai"'\ where J^{ki,Wu,i) = Wu^+i- Now, for Wy / Wy^i, 

PiE4i\wy) = P((u,X2(iD,,,),y(0,v(w).)) 

E 

(ii"(t«„),y"(j))eA^"'(y,y|«",x5),-i«„7^i«„,i,jp(fci,i«„,i)='i'u,i+i 
p{v'^{wy), y''{i)\u''{ki,Wu,i),x^{wu,i)) 

E 

(ii"(w„),y"(j))eA^"'(y,y|«",a;5),w„7^i«„,i,jc-(fci,i«„,i)=«,„,i+i 

(iS^ ) I ( /ci , j) , {wu,i))p{y'' (i) I ( fci , , a;^ ) ) (43) 



< 



-n(H(y|C/,X2)-e)2~"(H(y|l/,X2)-e) 
^ 2-niH(V\U,X2)+H{Y\U,X2)-H{V,Y\U,X2)-3e) 
^ 2-niI{V;Y\U,X2)-3t) 

where (j43p follows from the fact that y{i) and v(tZ'^) are independent for Wy ^ Wy^i. Thus, we have 

and so Wy = Wy^i with P(E4^i) arbitrarily small if n is sufficiently large and if 

Ry < I{V;Y\U,X2). (45) 

A.3 Achievability Proof of (|2T|) 

This proof also relies on the concept of backward decoding. Apply the code generation and encoding 
procedures from Section [A]2j Note that in this case, backward decoding is employed at the receiver to decode 
'Wu,i, not both Wu,i and Wy^i. Thus, after block B+1, y{B + 1) is used to decode Wu,b- Then, y{B) and Wu,b 
are used to decode Wu,b-i- Next, y{B — 1) and Wu,b-i are used to decode Wu,b-2- The process continues 
until y(2) and Wu^2 are used to decode Wu,i- The receiver can use block-by-block decoding to decode Wy^f, 
thus, Wy^i can be decoded in block i after y{i) is received, where i = 1,2,. . . ,B. 
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A. 3.1 Encoding and Decoding Error Analysis 

We first perform an error event analysis for the encoding stage. 

Encoding stage: The analysis for this stage is similar to the analysis for the encoding stage in Section 
IA.2[ Thus we have 

Ru<I{U;Yi\X2)-I{U;V\X2). (46) 

Note that for Wu,i, we perform backward decoding at the receiver, though we still perform block- by-block 
decoding at the relay. We also perform block- by-block decoding at the receiver for w^^i. 

Once again, we proceed through three decoding steps and employ the concept of weak typicality. Define 
the following error events: 

• Eli as the event that {u^{ki,W2u,i),X2{w2u,i),yi{i),y^{i)) ^ ^e"\ where (?) and are the obser- 
vations by the relay and receiver, respectively in block i. 

• Emi as the event that there is an error in block i at decoding step m — 1 for m = 2,3,4. 



Thus, the overall probability of error Pi"^ = -P(Um=i — X]m=i^(^™)- ^^^^ note that for 



n 



sufficiently large, P{Eii) < e by the asymptotic equipartition property (AEP). Now we bound P{Emi) for 
m = 2,3,4 as follows. 

Decoding step 1: Upon observing y^{i), the receiver declares that Wy^i = Wy is sent if it is the unique index 
such that {v"- (wy) , y"- (i)) G Ai"'\ Here, E2i is the event that 3wv ^ w^^i such that {v"' (wy) , y"' (i)) G Ai"'\ 



Now, for Wv 7^ Wv. 



11 



P{E2^\w,) = P((v(«),),y(i))G4")) 



p{v-{w.),y-{i)) 



(t;"(t«,),j/"(i))GA^"'{V,y),«)„^t«„,i 



< 



P{v''{wv))p{y''ii)) (47) 
A^^'^ {V, Y) I 2-"(^(^)-^)2~"(^(^)-') 



^ 2-n{I(V;Y)-ie) 

where (j47p follows from the fact that y(i) and 'v{w-u) are independent for Wy ^ w^^i. Thus, we have 

P{E2i) < 2"-f^" • 2-"(^(^;^)-3^) (48) 
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and so Wy^i = Wy^i with P{E2i) arbitrarily small if n is sufficiently large and if 

Ry<I{V;Y). (49) 

Decoding step 2: The analysis for this step is similar to the analysis for decoding step 1 in Section [A.2I 
Thus we have 

Ru<I{U-Yi\X2). (50) 

Decoding step 3: Backward decoding is employed to estimate Wu^i at the receiver. Assume that the 
receiver has estimated Wu,i+i for Wu,i+i- Recall that the receiver has estimated Wy^i for Wy^i in decoding 
step 1. Now, the receiver looks for a unique W2u such that {u'^{ki,W2u),X2{w2u),y^{'i):V^{wy^i)) G A^^\ 
where T{ki,W2u) = Wu,i+i- It then declares Wy = W2u if this unique W2u exists. Here, is the event 
that 3wu / Wu,i such that {u^''{ki,'Wu), X2{wu),y'^{i),v^^{wy^ i)) G aI^\ where J^{ki ,Wu) = Wu,i+i- Now, for 

Wu / Wu,i, 

P{E4i\wu) = P((u,X2(u;„),y(i),v(rZ;„,i)) G 4")) 

^ E 

pivl'iki, Wu):x':^{wu),y''{i)\v''{wy^i)) 



E 

p(M"(fci,i()„),x^(i()„)|t;"(u)^,i)p(y"(i)|f"(?I}^,,,) (51) 



< 



a(")([/, ^2, y|w") 2-"(^('^'^2|v)-e)2-n(H(y|y)-E) 

< 2-'n{H{U,X2\V)+H{Y\V)-H(U,X2.Y\V)-'i€) 
^ 2-n{I{U,X2]Y\V)-U) 

where ([5T]) follows from the fact that y{i) and {\i{ki,Wu)-,'^2{wu)) are independent for Wy ^ Wu,i- Thus, we 
have 

P{Eii) < 2"^" • 2-"(^(f^'^2;y|V)-3.) (52) 
and so Wy = Wy,i with P{E/^i) arbitrarily small if n is sufficiently large and if 

Ry<I{U,X2;Y\V). (53) 
Now, we combine ([I6|) . (j50|) and ([53|) to obtain 

i2„ < min((/(?7;yi|X2) -/([/; y|X2)),/(?7,X2;y|F)). (54) 
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Figure 1: Discrete memoryless relay channel. 




Figure 2: Gaussian MIMO Relay Channel. 




Figure 3: Gaussian MIMO relay channel with superposition coding. 
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Comparison of message splitting with Wang et. al for norm(H ) = 10 
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Figure 4: Achievable rate results for the case where the transmitter, the relay, and the receiver are all 
equidistant from each other, or 71 = 72 = 73- 



Comparison of precoding mettiods witti Wang et. al for norm(H1 ) - 1 
2.35 1 ^ ^ ^ ^ ^ ^ ^ ^ 




20 40 60 80 100 120 140 160 180 
Angle between H2 and HI (degrees) 



Figure 5: Achievable rate results for the case where the relay is closer to the transmitter than to the receiver, 
or 72 = 73 and 71 = IO72. 
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Figure 6: Achievable rate results for the case where the relay is closer to the receiver than to the transmitter, 
or 7i = 72 and 73 = IO71. 
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